Document Clustering in Forensic Investigation by Hybrid Approach
نویسندگان
چکیده
منابع مشابه
FP-GROWTH APPROACH FOR DOCUMENT CLUSTERING by
Since the amount of text data stored in computer repositories is growing every day, we need more than ever a reliable way to group or categorize text documents. Most of the existing document clustering techniques use a group of keywords from each document to cluster the documents. In this thesis, we have used a sense based approach to cluster documents instead of using only the frequency of the...
متن کاملDocument Clustering using Hybrid ACO-TS
The Information Retrieval (IR) system is facing lot of challenges due to the widespread usage of computers for mass storage and the availability of tremendous information in World Wide Web. Clustering of documents available improves the efficiency of IR system. The problem of clustering has become a combinatorial optimization problem in IR system due to the exponential growth in information ove...
متن کاملAn efficient hybrid distributed document clustering algorithm
Recent advances in information technology have led to an increase in volumes of data thereby exceeding beyond petabytes. Clustering distributed document sets from a central location is difficult due to the massive demand of computational resources. So there is a need for distributed document clustering algorithms to cluster documents using distributed resources. The greatest challenge in this a...
متن کاملSubject-based semantic document clustering for digital forensic investigations
Computers are increasingly used as tools to commit crimes such as unauthorized access (hacking), drug trafficking, and child pornography. The proliferation of crimes involving computers has created a demand for special forensic tools that allow investigators to look for evidence on a suspect’s computer by analyzing communications and data on the computer’s storage devices. Motivated by the fore...
متن کاملOptimum Cluster Labeling and Document Clustering for Forensic Analysis
Document clustering or unsupervised document classification is an automated process of grouping documents with similar content. Document clustering is an important task in many Information Retrieval systems. Also document clustering Algorithms can help in discovery of new and useful knowledge or novel class from the documents under analysis. This knowledge or novel class is very important issue...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2014
ISSN: 0975-8887
DOI: 10.5120/15860-4784